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Enumeration of saturated chains in Dyck lattices 

Luca Ferrari* Emanuele Munarini^^ 



Abstract 

We determine a general formula to compute the number of saturated chains in Dyck 

P\J ' lattices, and we apply it to find the number of saturated chains of length 2 and 3. We also 

compute what we call the Hasse index (of order 2 and 3) of Dyck lattices, which is the ratio 

^^ ' between the total number of saturated chains (of length 2 and 3) and the cardinality of the 

^ , underlying poset. 

1 Introduction 

O 

en ■ Given a poset, a very natural problem is to count how many saturated chains it has. A 

saturated chain in a poset is a chain such that, li x < y are consecutive elements in the chain, 

^^ I then y covers x. In the present paper we wish to address this problem in the case of Dyck lattices. 

r) • The Dyck lattice of order n, to be denoted P„, is the lattice of Dyck paths of semilength n whose 

associated partial order relation is given by containment: given 7,7' G T>n, it is 7 < 7' when, in 

the usual two-dimensional drawing of Dyck paths, 7 lies weakly below 7'. Some papers studying 

properties of Dyck lattices are |FMH IFPj . Counting saturated chains of length 1 is clearly 

equivalent to enumerating edges in the associated Hasse diagram, which has been considered in 

[FM2] not only for Dyck lattices but also for other lattices of paths. Here we start by providing 

a general formula for counting saturated chains of length h, for any fixed h, in a given Dyck 

f^ — ■ lattice. Next we deal with the cases h = 2,3, giving for them detailed enumerative results. We 

^^ ! also define the notion of Hasse index of order h (thus generalizing the concept of Hasse index 

proposed in |FM2j ) and compute such an index in the two mentioned special cases. 



m 

^ ; 2 Preliminaries 

In this section we collect some notations and results which will be used in the sequel. 

p\ . A Young tableau is a filling of a Ferrers shape A using distinct positive integers from 1 to 

» I n = |A|, with the properties that the values are (strictly) decreasing along each row and each 

column of the Ferrers shape. Here |A| denotes the number of cells of the Ferrers shape A. This 
constitutes a slight departure from the classical definition, which requires the word "increasing" 
instead of the word "decreasing". However, it is clear that all the properties and results on 
(classical) Young tableaux can be translated into our setting by simply replacing the total order 
"<" with the total order ">" on N. A skew Young tableau is defined exactly as a Young tableau, 
with the only difference that the underlying shape consists of a Ferrers shape A with a (possibly 
empty) Ferrers shape /x removed (starting from the top-left corner), in such a way that the 
resulting shape is strongly connected: this means that every pair of consecutive rows has at least 
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one common column and every pair of consecutive columns has at least one common row (such 
a shape will be also called a skew Ferrers shape). 

A Dyck path is a path starting from the origin of a fixed Cartesian coordinate system, 
ending on the x-axis, never going below the x-axis, and using only the two steps u = (1, 1) 
and d = (1,-1). A valley (peak) in a Dyck path is a pair of consecutive steps du (ud). The 
semilength of a Dyck path is just half the number of its steps. The set of all Dyck paths of 
semilength n will be denoted -D„. 

The set Dn endowed with the partial order described in the Introduction will be called the 
Dyck lattice of order n and denoted V^- The generating series of saturated chains of length h 
in the family of Dyck lattices will be written SCh{x), whereas the number of saturated chains 
of length h in P„ (i.e. the coefficient of x" in SCh{x)) will be written sch{T>n)- 

At the end of this section, we propose a generalization of the notion of Hasse index given 
in [FM2j . Recall that the Hasse index i{V) of a poset V is given by i{V) = tWj where i{V) is 
the number of covering pairs in V. Given a positive integer h, we now define the Hasse index of 
order /i of P as ih^P) = ^^pp\ ; where schCP) denotes the number of saturated chains of length 
h of the poset V. Of course ii{V) = OV)- For instance, for the Boolean algebra Bn having 
2"- elements, sch{Bn) can be computed by taking an arbitrary subset having k elements (for 
< A; < n) and then adding any h of the remaining elements in a specified order. Equivalently, 
we can choose a subset having h elements, a linear order on it, and a subset of its complement. 
Therefore we get 

schiBn) = J2 (l) (^ - ^)^ = H^2"-\ 

fc=0 ^ ^ 

where {a)h = a ■ {a — 1) ■ . . . ■ {a — b+ 1) denotes a falling factorial. 
Thus, the Hasse index of order h of Bn is given by 

. .« ^ (n),, . 2^-^ (n). 



2" 2^ 

We will say that the Hasse index of order /i of a sequence ^ = {T'o,T'i,T'2, ■ ■ ■ j'Pm ■ ■ ■} 

^^ and asymptotically Boolean when ihCPn) ~ 2^ 



of posets is Boolean when ihCPn) = -^jt" and asymptotically Boolean when ihCPn) ~ ^rr (or, 



which is the same, ihiVn) ~ 2^)- 

In the computation of the Hasse index we will also use the well known Darboux theorem (see 
for instance [BLLj), which asserts that, given a complex number ^ 7^ and a complex function 
f{x) analytic at the origin, if f{x) = (1 — x/^)~"-ip{x), where ip{x) is a series with radius of 
convergence R > \^\ and a {0, —1, —2, . . .}, then 

3 The general enumeration formula 

Let 7*-'^^ < 7*^"'^'' < • • • < 7^^*' be a saturated chain (of length h) in P„. It is easy to see that 
two consecutive paths of the chain only differ by a pair of consecutive steps, namely a valley (a 
peak) in the smallest (largest) one. More generally, the minimum 7'^^' and the maximum j^^' 
differ by a set of steps in such a way that the sum of the areas of the regions delimited by 
these steps is equal to h. To be more precise, this means that the two paths can be factorized 

CO) (0) (0) (0) J (h) (h) (h) (h) , r 

as 7'^'''' = ai7| a272 "^Tfe «fc+i aiid 7 = "i7i "272 "fc7fc "fc+i, where, for every 

i, the two factors 7^^ and 7^^ have the same length, and the sum of the areas of the regions 
determined by the pairs of factors (7,- , 7^ ) is equal to h (see Figure [T|) . 




Figure 1: A pair of Dyck paths 7 (thick) and 7' (dashed), with 7 < 7', and the corresponding 
set of skew Ferrers shapes. 



Each of the regions determined by the pairs (7]^ ,7^^ ) can be regarded as a skew Ferrers 
shape. To fix notations, we will suppose that such a shape is that obtained by rotating the sheet 
of paper by 45° anticlockwise. Referring again to Figure [U the pair of Dyck paths on the left 
determines the pair of skew Ferrers shapes on the right. 

Now suppose to select a saturated chain from 7'^' to 7^'*'. This corresponds to choosing, 
one at a time, the h cells belonging to the skew Ferrers shapes described above. More formally, 
this defines a linear order on the set of all cells of the skew Ferrers shapes determined by the 
two paths such that, on each row and on each column, cells are in decreasing order. This means 
that a saturated chain essentially generates a set of skew Young tableaux. 

Let now 7 G P„. We want to determine the number of saturated chains of length h starting 
from 7 in 2?„. According to the above considerations, to describe any such chain we start by 
giving a partition A = (Ai, . . . , A^) of h. Next we have to choose a set 71, . . . , 7^ of factors of 7 
such that, for any i < k, we can build a skew Ferrers shape (pi on 7^ having area Aj. Finally, to 
determine the saturated chain, we just have to linearly order the cells of the Ferrers shapes thus 
obtained, or, equivalently, to endow each of the shapes with a skew Young tableaux structure. 

Now we will try to describe more formally the above argument. Denote by SkFS the set of 
all skew Ferrers shapes. Given ip G SkFS, we write A(ip) for the area of ip, i.e. the number of 
cells of (f. We also define SkFS{n) = {c^ S SkFS \ A{ip) = n}. Given a set of words 71, . . . , 7„ 
on the alphabet {u,d}, we say that they are a set of pairwise disjoint occurrences (p.d.o.) in 
7 when they appear as factors of 7 having no pairwise intersection. A skew Ferrers shape ip 
is delimited by two paths, both starting at its bottom left corner and ending at its top right 
corner. Each of such paths can be seen as a word on {u,(i}, by simply encoding a horizontal 
step with the letter d and a vertical step with the letter u. The word having d as its first letter 
is called the lower border of ip and is denoted b{ip). Finally, for any given ip G SkFS, let t{ip) 
be the number of skew Young tableaux of shape ip. 

For any path 7 G P„, let A = (Ai, . . . , A^) be a partition of the positive integer h (this will 
also be written as A h /i). Next we have to choose a set 71, . . . , 7^ of pairwise disjoint occurrences 
in 7 such that, for any i < k, there exists a skew Ferrers shape ipi S SkFS{\i) for which 
b{ipi) = 7j. Now, to get a saturated chain, we have to select a A:-tuple {ipi, . . . ,(pk) G SkFS^ 
such that b{ipi) = 7^ and A{ipi) = Aj, and for each component ipi we have to choose one among 
the t{ipi) possible skew Young tableaux. Finally, since the set of integers actually used to fill 
in the cells of each ipi can be any possible set of |Aj| integers less than or equal to h, we have 
proved the following result. 

Theorem 3.1 The number schiVn) of saturated chains of length h of the lattice Vn is given 
by the following formula: 
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Ah/l 7l,---,7fcP-d.o. 

(Vi)(3vieSfcFS(Ai))6(¥'i)=7i 



(¥.i,...,</'fc)SSfcFS'= 



h 

A{(pi),...,A{ipk) 



t{ipi)---t{^k). (1) 



In the rest of the paper, our main ahn is to apply the above formula to the special cases 
h = 2 and h = 3 (the case h = 1 having already been examined in [FM2j ) , thus finding some 
new results on the poset structure of Dyck lattices. 

We end the present section by recalling that this problem could also be tackled from a 
slightly different point of view. Indeed, given two Dyck paths of the same length 71 and 72 such 
that 71 < 72 , the set of all saturated chains between 71 and 72 can be represented by means of a 
suitable Polya festoon, more precisely a Polya festoon whose components cannot be —polygons 
(see [F]). It seems that this approach could be more elegant, but should lead to more difficult 
computations. 

We also remark that, in the paper |CPQS| , pairs of noncrossing free Dyck paths (also called 
Grand-Dyck paths in different sources) are considered, also in connection with several different 
combinatorial structures, such as noncrossing partitions and vacillating tableaux. It could be 
of some interest to extend our results to the case of free Dyck paths and successively interpret 
them on the above mentioned combinatorial objects via the bijections described in |CPQS| . 

4 Saturated chains of length 2 

In order to apply formula ([1]) to the case of saturated chains of length 2 we simply have 
to set h = 2. Doing this way, one immediately observes that there are only two partitions of 
2, namely (1,1) and (2), and that there exists one pair of "admissible" skew Ferrers shapes of 

area 1, i.e. (D, D), and two different skew Ferrers shapes of area 2, i.e. CD and CJ . Since each of 
these shapes can be endowed with only one Young tableau structure, we arrive at the following 
result. 

Proposition 4.1 The generating series for the number of saturated chains of length 2 of Dyck 
lattices is given by 

SC2{x) = XI I 5Z (^ ■ ^('^^' "^""^-y + #iddu)-y + #{duu)^) j x"", (2) 

n>0 \-yeV„ J 

where with #(71, . . . ,7^)7 we denote the number of pairwise disjoint occurrences of the ji 's in 

7- 

All we have to do now is to evaluate the three unknown quantities appearing in ([2]). The 
following proposition translates formula ^ into an expression more suitable for computing. 

Proposition 4.2 Denote with F{q, x) and V{q, x) the generating series of all Dyck paths where 
X keeps track of the semilength and q keeps track of the factor duu and of the factor du (i.e. 
valleys), respectively. Then 



SC2{x) = 2 ■ 
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dq 
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q=l 



dq^ 



(3) 
9=1 



Proof. The expression 



dV_ 

dq 



gives the generating series of Dyck paths with respect to 

q=l 



the number of valleys. Analogously, the expression ^-|- gives the generating series of Dyck 



dq'^ 



q=l 



paths with respect to the number of (non ordered) pairs of valleys. Moreover, the expression 
^ gives the generating series of Dyck paths with respect to the number of factors duu. 



Since the factors ddu and duu are obviously equidistributed in the set of Dyck paths, formula 
([3]) immediately follows. ■ 

We are now in a position to find a neat expression for the generating series SC2{x). 

Theorem 4.1 The generating series for the number of saturated chains of length 2 of Dyck 
lattices is given by 

„„ . ^ V^ /-T^ ^ n 1 - 6x + 6x^ - (1 - 4x)\/l -4x 

where 

, , (2n\ (n- l)(n-2) , , 

Proof. Let G{q,x), H{q,x) be the generating series of Dyck paths starting with a peak 
and Dyck paths starting with two consecutive up steps, respectively, where x keeps track of the 
semilength and q keeps track of the factor duu. Since any non-empty Dyck path 7 decomposes 
uniquely as 7 = Uj'D'j", where 7', 7" G T), and 7" starts either with a peak or with two 
consecutive up steps (if it is not the empty path), we arrive at the following system (where F 
is defined as in the previous proposition): 

' F = 1 + xF{l + G + qH) 

G = x{l + G + qH) (6) 

^H = x^F{l + G + qHf. 
Solving for F, we find the following expression: 



1 - 2(1 - q)x - a/1 - 4x + 4x2 - Aqx'^ 
^^'^'") = 2fx • 

Moreover, the explicit expression of V{q,x) (see again the previous proposition) can be 
found in [Dl IFM2J , and is the following: 



^ ^ 1 - (1 - g)x - yi - 2(1 + q)x + (1^^)^ ^ 

2qx 
We can therefore apply the previous proposition, thus obtaining formula Q. ■ 

The integer sequence associated with 5(72 (x) starts 

0,0,0,4,30,168,840,3960,18018,80080,.... We observe that the terms of the above se- 
quence divided by 2 yield sequence A002740 of [S]. In terms of Dyck paths, this sequence gives 
the sum of the abscissae of the valleys in all Dyck paths of semilength n — 1. It would be nice 
to have a combinatorial explanation of this fact. 

The results of the present section allow us to compute the Hasse index of order 2 of Dyck 
lattices. Recall that in [FM2J it is shown that the Hasse index of order 1 is asymptotically 
Boolean. 

Proposition 4.3 The Hasse index of order 2 of the class of Dyck lattices is asymptotically 
Boolean. 

Proof. Since \Vn\ = (^);^! from formula ([5]) we get 

. ,^ , sc2{Vn) (n-l)(n-2)(n + l) n^ 

which means precisely that the Hasse index of order 2 is asymptotically Boolean. ■ 



5 Saturated chains of length 3 

Setting /i = 3 in ([1]) we obtain a formula for the enumeration of saturated chains of length 
3 of Dyck lattices. Similarly to what we did in the previous section, we observe that there are 
three partitions of the integer 3, namely (1,1,1), (2,1) and (3). Moreover, the unique "admissible" 
triple of skew Ferrers shapes of area 1 is (D, D, D) , whereas there are two pairs of skew Ferrers 
shapes whose first component have area 1 and whose second component has area 2, namely 

(n, tj) and (n, I I l ), and there are four skew Ferrers shapes having area 3, i.e. CJ , I I I I , IZtzl 

and tn . Unlike the previous case, now we have two shapes (of area 3) each of which can be 
endowed with two different Young tableaux structures. More precisely, we have to consider the 

two skew Young tableaux [III] , [III] and the two (skew) Young tableaux [If , EP . Thus, a direct 
application of formula ([T|) leads to the following statement. 

Proposition 5.1 The generating series for the number of saturated chains of length 3 of Dyck 
lattices is given by 

SC3{x) = y^ y^ (6 • #{du, du, du)^ + 3 • #{du, ddu)^ 

n>0 7eX'n 

+ 3 • #{du, duu)^ + #{dddu)^ + #{duuu)-y 
+ 2 • #{dduu)^ + 2 • #{dudu)y) x". 

Our next step will be the evaluation of the unknown quantities appearing in ([S]). 



(8) 



of 



Analogously to the case of saturated chains of length 2, we start by finding an expression 
8]) better suited for computation. 



Proposition 5.2 Denote with A{q,x), B{q,x) and C{q,x) the generating series of Dyck paths 
where x keeps track of the semilength and q keeps track of the factors dduu, dudu and duuu, 
respectively. Moreover, let V{q, x) he defined as in the previous section. Finally, let F{y, q, x) 
be the generating series of Dyck paths obtained from the series F{q, x) defined in the previous 
section by adding the indeterminate y keeping track of valleys (i.e. of the factor du). Then 



5^3 (x) 



2- 



+ 



'dA 
dq 

dq^ 



+ 2 



9=1 



+ 6- 



g=i 



dB 
dq 



d^F 
dydq 



<?=! 



+ 2 

dF' 
dq 



dC 
dq 



g=l 



(9) 



y=q=l 



Proof. We start by observing that the knowledge of the generating series F{y,q,x) allows 
us to compute the term of ([5]) associated with the pair (du, duu). Indeed, it is clear that, if we 
differentiate F with respect to y and q and then evaluate at y = (7 = 1, we obtain the generating 
series of Dyck paths with respect to semilength and number of pairs {du, duu). However, in this 
way we are going to consider also those pairs in which the valley du is part of the factor duu. 
Thus, to obtain what we need, we have to subtract the derivative of F with respect to q, then 
evaluate aX, y = q = 1, which yields the expression 

d'^F dF' 
dq 



dydq 



y=q= 



Moreover, it is clear that the generating series describing the distribution of the pair 
{du, ddu) is the same, and this explains the coefficient 6 in front of the above displayed ex- 
pression in formula Q. 



6 



Finally, the meaning of the partial derivatives of the generating series A, B and C are 
obvious (notice, in particular, that the the factors dddu and duuu are clearly equidistributed, 
so they are both described by series C), as well as the triple partial derivative of V evaluated 
in g = 1, which gives 6 times the distribution of triples of valleys in Dyck paths. ■ 



Theorem 5.1 The generating series for the number of saturated chains of length 3 of T>n is 
given by 

, P{x) - Q{x)^Jl-Ax 



SC3{x) = Y,sc3{n 

n>0 

where P{x) = 1 - 13x + b9x^ - lOOx^ 
1 - llx + 39x2 _ 4Q^3 _ 22x4. 

The coefficients sc^iVn) can be expressed as 



x{l — 4x)^ 
lOOx'^ + IQx* + 64x^ = (1 - 4x)^(l - x - x"^) and Q{x) 



(10) 



SC3{T>n) 



2n\ {n^ - 7n + 2){n - 2) 
n) 4(n+l)(2n-l) 



(n > 2). 



Proof. We start by considering the generating series F, G, H defined in the previous 
section. Similarly to what we did in the above proposition, we need to add an indeterminate 
y which will keep track of valleys. Thus, in the following, we will have F = F(y,q,x), and the 
same for G and H. 

Using the same decomposition of Dyck paths described in Theorem 14. 11 we can now rewrite 
system ([B]) taking into account the presence of the indeterminate y, thus obtaining 



F = 1 + xF(l + yG + yqH) 
G = x{l + yG + yqH) 
[H = x^F{l+yG + yqH)\ 



fill 



The solution of such a system is the following: 



p ^ l-il+y-2yg)x-^{l+2y+y^~4yq)x'2-2{l+y)x+l 

2yqx 

r - (l-(l+y)^-VA)(l-(l+;/-2OT)x+\/A) 
<j" — j—rz — TT^rm — rvT!) 



R 



^jqx~Ay'^q(\ — q)x'^ 
(-l+(l+i/)3;+\/A)(l-(l+;/-2OT)x+v^) 
2yqx{'4yqx—Ay'^q{\—q)x'^^ ' 



where A = 1 - 2(1 + y)3; + (1 + 2y + y^ _ Ayq)x'^ . 

The expression of F allows us to compute the term of ([8]) associated with the pair (du, duu): 



d'^F dF 



dydq dq 



y=q=l 



-2 + 15x - 30x2 j_ ^Q^3 + (2 - llx + 12x2)Vl - 4x 
2x(l - 4x)Vl - 4x 



Recalling the expression of the generating series V reported in ([7|), we obtain: 



dq^ 



q=l 



3(1 - llx + 40x2 - 50x3 _^ iQ^i _ (1 _ 9x + 24x2 - 16x^)71 - 4x) 

x(l -4x)2Vl - 4x 



The generating series A and B can be easily computed starting from the functional equations 
they satisfy, which can be found in |STTj and are reported below for the reader's convenience: 



x{q + (1 - q)x)A^ - (1 + (1 - g)(x - 2)x).4 + 1 - (1 - g)x 
xS2 + ((1 - q){x - l)x - l)B + (1 - g)x + 1 = 0. 







More precisely, we obtain the following expressions: 



-l+2(l-g)a:-(l-g)x-^ + ^l-4a-+2(l-g)x-2+(l-g)2^ 

-2x {q+{l-q)x) 

l+{l-(j)x'-(l-g)2)2-^l-2(l+g)a'-{5-4g-ij2)a;2-2(l-g)2a'3+(l_g)2 

2a; 



A{q,x) 
B{q,x) 

Differentiating with respect to q and evaluating at g = 1 we then obtain: 



(12) 



dA 

dq 

dB 



dq 



9=1 
9=1 



l-5a+5a:2-(l-3a+x2)v/l-4a 

2xVl-4x 
l-'ix-(l-x)y/T^^ix 
2y/T^^Ax 



Instead the computations related to the generating series C are a little bit more complicated. 
Again in |STT| we find the following functional equation satisfied by C: 

qxC^ + (3(1 - q)x - l)C'^ - (3(1 - q)x - 1)C + (1 - q)x = 0. 

Differentiating both sides with respect to q and then solving for ^ yields: 

xC^ - 3xC^ + 3xC - x 



dC 

dq 



' SqxC^ + 2(3(1 - q)x - 1)C - 3(1 - q)x + l' 



Now, evaluating at q = 1 and recalling that C{l,x) 
Catalan numbers, we get the following: 



- — ^ — - is the generating series of 



dC_ 
dq 



9=1 



-1 + 6x - 9x2 _^ 2x^ + (1 - 4x + 3x'^)^/l-4x 
x{l -4x - \/l - Ax) 



We finally have all the information needed to compute SC^^x) using ([9|), and we obtain 
formula ()10p . A careful algebraic manipulation of this series yields the stated expression for the 
coefficients 503(2?^). ■ 

The integer sequence SC3(P„) starts 0, 0, 0, 2, 38, 322, 2112, 12210, 65494, 334334, .... Neither 
this sequence nor such a sequence divided by 2 appear in [S]. 

Proposition 5.3 The Hasse index of order 3 of the class of Dyck lattices is asymptotically 
Boolean. 

Proof. Since we have not fully explained the computations needed to derive the coefficients 
sc3(2?n); we will provide a proof independent from the explicit knowledge of such coefficients. 
Since series (1101) can be rewritten as: 



ill 

X 



X — X 



Q{x) 



(l-4x)5/2_ 

when n is sufficiently large we have 

sc^{Vn) = W']SC^{x) = -K+i]Q(x)(l - Ax)-'>'\ 
Using Darboux's theorem, we get 

Q(0 (n + 1)5/2-1 



SC3(Pri 



e' 



n+l 



r(5/2) 



where S, = \- Since Q{^) = ^ and r(|) = -^, we obtain 

SC3{T>n) ^ . 

Recalling that IP^I ~ — ^, we finally have 

SC3(i:'„) n^ 



^3(^n) 



|2?„ 



6 Conclusions and further work 

We have derived a general formula for the enumeration of saturated chains of any fixed 
length h in Dyck lattices. However, we have applied such a formula only when h is small 
(namely h = 2,3). When h becomes bigger, computations become much more complicated. Is 
it possible to conceive a different approach more suitable for effective computation? 

We have proved that the Hasse indexes of order 1,2 and 3 of Dyck lattices are asymptotically 
Boolean. The obvious conjecture is that the Hasse index of any order h is asymptotically Boolean 
too. 

Is it possible to extend our approach to enumerate chains in Dyck lattices? 

The problem of enumerating (saturated) chains can also be posed for other classes of posets. 
In this context, it would be interesting to find analogous results in the case of Motzkin and 
Schroder lattices. 
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